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QUANTILE  BASED  UNIFIED  DISTRIBUTION 
OF  EXTREME  VALUES  AND  ORDER  STATISTICS 

i  Emanuel  Parzen1 

•  Texas  A&M  University 

Abstract^ This  paper  presents  ideas  leading  to  unified  formulas  for  exact  and  asymptotic 
distributions  of  central  and  extreme  order  statistics  of  random  samples  and  stationary 
time  series  X(i)  in  terms  of  quantile  functions  Q(u),  density  quantile  functions  /Q(u), 
left  and  right  hazard  quantile  functions  hQ(u),  defined  to  equal  respectively  fQ(u)/u  and 
fQ(u)/(l—u),  tail  exponents,  and  the  spectral  density  near  zero  frequency  of  a  two-valued 
time  series  e(X(t );  u),  equal  to  1  -  u  or  -u  as  X(t)  <  Q(u)  or  X(i)  >  Q(u).  v - " 

1.  Introduction. 

1.0.  Quantile  function  concepts  are  reviewed  (Section  2).  Section  3  reviews  important 
sampling  distributions.  Sections  4  and  5  outline  the  asymptotic  distribution  theory  of 
extreme  and  central  order  statistics.  The  unified  formula  is  presented  in  Section  6;  com¬ 
parisons  of  exact  and  approximate  distributions  are  made  numerically.  Section  8  outlines 
extensions  to  stationary  time  series. 

1.1.  A  random  sample  XlfX3,...,Xn  consists  of  independent  random  variables  iden¬ 
tically  distributed  as  a  random  variable  X.  Its  order  statistics,  denoted  X(l,n)  <  •••  < 
X(k\n)  <  *••  <  X(n\ n)  are  the  values  in  the  sample  arranged  in  increasing  order.  The 
sample  minimum  and  maximum  are  respectively  equal  to  X(l;n)  and  X(n;n). 

1.2.  Extreme  value  theory  initially  seeks  to  identify  the  distributions  which  can  arise 
as  the  distributions  of  X(l;n)  and  X(n;  n)  for  large  n  (in  the  limit  as  n  — ►  oo).  It  is 
motivated  by  the  hope  that  these  distributions  will  provide  models  for  variables  observed 
in  reliability  studies.  We  present  the  theory  of  extreme  value  distributions  in  a  way  that 
broadens  our  understanding  of  the  models  to  fit  to  data. 

1.3.  Extreme  value  theory  does  not  seem  to  be  routinely  studied  by  statisticians.  The 
literature  on  extreme  value  theory  is  extremely  large  and  does  not  seem  to  be  easy  to 
apply.  This  paper  proposes  that  the  quantile  function  descriptions  of  probability  laws  can 
be  used  to  develop  a  unified  theory  of  the  distribution  of  order  statistics  in  large  samples 
which  is  easier  to  apply  (and  even  to  teach  in  introductory  courses). 

1.4.  In  the  quantile  based  approach  a  central  role  is  played  by  the  order  statistics 
U(l;n)  <  •••  <  U(n;n)  of  a  random  sample  of  a  random  variable  U  which  is  Uniform 
[0,1].  Their  exact  and  asymptotic  distributions  are  easily  found. 

1.5.  A  philosophical  note:  Our  interest  in  order  statistics  is  motivated  in  part  by:  (1) 
a  conjecture  that  statistical  inference  is  made  possible  by  the  fact  that  order  statistics  pos¬ 
sess  asymptotic  distributions  whoee  parameters  are  the  quantile  function  and  the  density 
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quantile  function,  and  (2)  a  ‘definition’  of  statistics:  ‘Arithmetic  done  by  ranking  before 
adding’. 


How  should  one  find  the  average  X  of  ■X’(l),  •  •  • ,  X(n);  the  conventional  answer  is  add 
and  divide  by  n: 

«=  i 

Our  answer  is:  rank  (form  order  statistics)  and  then  add  to  form 

X=\Y,X(k-,n). 

*  -  l 


We  argue  that  ranking  before  adding  to  find  X  is  better  in  both  practice  and  theory.  In 
practice  the  ranked  values  can  identify  the  types  of  distribution  obeyed  by  the  sample.  In 
theory,  to  study  the  asymptotic  distribution  of  X,  one  has  to  determine  the  contribution 
of  the  extreme  values,  and  in  particular  if  their  distribution  will  be  such  as  to  make  the 
distribution  of  X  non-normal. 

1.6  The  goal  of  this  paper  is  to  state  a  unified  formula  for  distribution  of  extreme 
and  central  order  statistics.  It  can  be  summarized,  using  the  notation  in  Table  1.  For 
0  <  uh;n  <  0.5,  approximately. 


For  0.5  <  ufc;B  <  1,  approximately, 


/«*(.,.>(«) = {„  r!  s'  /«(•».•)♦*- * w 


The  shape  of  the  distribution  of  order  statistics  can  be  determined  from  graphs  of  the 
approximate  formulas  for  their  density  quantile  functions.  To  apply  these  results  in  practice 
one  has  only  to  determine  the  values  of  oto,  a„  m0,  mx.  It  should  be  noted  that  these 
approximations  are  not  valid  in  the  extremes  (as  u  tends  to  0  or  1)  because  their  tail 
exponents  are  not  exact.  The  scale  parameters  for  extreme  values  of  these  approximating 
distributions  are  the  reciprocals  of  the  hazard  factors  in  Table  2. 


Table  1:  Table  of  Notation 


Notation 


Meaning 


F(i) 

/(*) = n*) 

Xi Xn 

X(fc;n),X(n  +  k  -  1;») 

«*:»  =  k/(n  +  1) 

U(k,n) 

<?(«)  =  F-'{u)JQ(u)  =  f(Q(u)) 
Q(Beta(Jfc,  n  +  1  —  A;)) 

(n  +  1)Z7(Ae;  n)  -►  Gamma(fc) 
GammaA(m)  =  ^Gamma(m) 

+ 


Distribution  Function 
Probability  Density  Function 
Random  Sample 

Extreme  order  statistic  if  k  fixed; 
central  order  statistic  if  k  -*■  oo 
as  n  -+  oo  and  k/n  -*  p  >  0 

Order  statistic  from  Uniform  [0,  l] 

Quantile,  density  quantile  function 

Exact  Distribution  of  X(k;  n ) 

Asymptotic  Distribution  of  U(k\n) 

Wilson-Hilferty  Approximation 


_  1 

g(x;  A)  =  — - — ,  A  <  0  or  A  >  0 

=  log  x,  A  =  0 

ff  (GammaA(m);  A) 

X(*;n)  =  Q(«fcif.)+W(Jb;n) 

Asymptotic  distribution  of  X(k;n) 

W(k;n)  =  {hoQ(u*;,»})'I0(GammaA(mo);l 

W(k,n)  =  {fciQ(u*;„)}"ly(GammaA(m1);l 

/ioQ(u)  =  fQ{  u)/u 

kiQ{u)  =  fQ{u)/(l  -  «) 

/Q(u)  =  «-Lo(u) 

L0(u)  ~  (log  «)* 

/Q(i-u)  =  (i-«)*‘i;l(u) 

Li(u)  ~  ((log(l  -tt)-1)^ 


Power  (Box-Cox)  transformation 

PowerG(A,m)  distributed  variable 
X[k\n)  estimates  Q(uh.n) 

Oo)  0  <  u*;i»  <  0.5 
<*i)(-l)  0.5  <  uk.n  <  1 

left  hazard  quantile  function 
right  hazard  quantile  function 
Oo  left  tail  exponent 
L0(u)  slowly  varying,  log-like 
ax  right  tail  exponent 
Li  (u)  slowly  varying,  log  -like 


mo  =  (n  +  l)ttfcin/(l  -uhin) 


Table  2:  Examples  of  Powers  of  n  in  Hazard  Factors 


-<*X 

1 

Uniform,  Exponential  (left  tail) 

hoQ(l/n)AQ(l-(l/n)) 

n 

.5 

Weibull  (left  tail) 

n»  =  n0-8 

0 

Q(u)  =  (log(l  -  u)-1)',  0  =  0.5 
Weibull  (right  tail) 

0(lo*  »)•■• 

0 

Normal 

(2log  n)0  8 

0 

Exponential  (right  tail) 

1 

0 

One-sided  stable,  index  .5  (left  tail) 

0.5(log  n)a 

-.5 

Infinite  variance  Pareto  (right  tail) 

n-°.« 

-1 

Cauchy 

jrn*1 

-2 

One-sided  stable,  index  .5  (right  tail) 

(x/2)n-3 

«(.)  =  {*-■(!-»)}-> 

2.  Quantile  Functions  Concepts. 

2.1.  The  probability  distribution  of  a  random  variable  is  usually  described  by  the 
distribution  function  F(x),  —  oo  <  x  <  oo,  defined  by 

F(x)  =  Pr\X  <  x}. 

The  quantile  function,  denoted  Q[u),  0  <  u  <  1,  or  F~ 1  (u),  0  <  u  <  1,  is  defined  by 

Q(ti)  =  F~ 1  (u)  =  inf  (*  :  F(x)  >  u). 

2.2.  Theorem:  Inequality  Inverse  Functions.  F(x)  and  Q(u)  are  inverse  of  each  other 
under  inequalities,  in  the  sense  that  (for  any  x  and  u) 

F(x)  >  «  if  and  only  if  x  >  Q(u). 

2.3.  Examples  of  Probability  laws  are: 

1)  Standard  normal:  F(x)  =  ft(x),Q(u)  =  *_l(u),  where 

*(x)  =  J  ^(y)dy,  <t>{x)  =  --Lexp(-^xa). 

2)  uniform  [0,1]:  F(x)  =  x,  0  <  x  <  1,  Q(u)  =  «,  0  <  u  <  1. 

3)  Standard  exponential:  F(x)  =  1  -  e"“,  x  >  0,  and 

Q{u)  =  —  log(l  —  u),  0  <  u  <  1. 
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4)  Weibull  <J(u)  =  {-  log(l  —  u)}* ,  0  <  u  <  1. 

5)  Cauchy:  Q(u)  =  tan  tt(u  —  0.5),  0  <  u  <  1. 

2.4  In  this  paper  theorems  on  the  asymptotic  distribution  of  order  statistics  and 
extreme  values  assume  that  F(x)  is  continuous,  with  probability  density  f(x)  =  F'(x)  and 
hazard  function  h(x)  =  f(x)/(l—F(x)).  We  define:  g(u)  =  Q'  (u) , quantile  density  function; 
fQ(u)  =  /(Q(u)), density  quantile  function;  hiQ(u)  =  /Q(u)/(l  — u), right  hazard  quantile 
function;  hoQ(u)  =  fQ{ u)/u,  left  hazard  quantile  function. 

2.5  Tail  exponents  of  probability  laws  are  defined  in  terms  of  representations  of  density 
quantile  functions  as  regularly  varying  functions.  We  call  Oo  the  left  tail  exponent  if 

/<?(«)  =  tt-Lo(u) 

where  L0  (u)  is  slowly  varying  as  u  -*  0,  in  the  sense  that  for  every  y  >  0 

L0{yv)/L0{u)  — ►  1  as  u  -+  0. 


Typically,  L0  (u)  ~  (log  u)*  for  some  0. 
We  call  c*!  the  right  tail  exponent  if 


/Q(l -«)  =  ***  MO 


Accession  For 

NTIS  GRAtl 
DTIC  TAB 
Unannounced 


where  Li  (t)  is  slowly  varying  as  t  -*  0.  We  prefer  to  write  this  representation  justif loatio 


By - - - 

Dlstrl but  1 on/ 

Availability  Codes 
jAvaii  and/or 
31st  Special 


/g(«)  =  (1  -  tt)“  ‘  Lx  (u)  By _ 

©Dlstrlbutl 
AvaTlabll 

Dlst 

fc0g(tt)  =  tta#-iL0(tt) 

hiQ(u)  =  (1  -u)*1'1!,^).  _ 

We  call  slowly  varying  functions  L(u)  log-like. 

2.6  Left  tails  of  a  distribution  are  classified  as  short,  medium,  or  long  tail  if  a<,  <  1, 
Oq  =  1»  Oq  >  1  respectively.  Right  tails  are  classified  as  short,  medium,  or  long  tail  if 
at}  <  1,  a*  =  1,  ai  >  1  respectively.  Examples  of  tail  classification:  Normal:  00=0!=  1; 
Exponential:  a<,  =  0, 0^  =  1;  Uniform:  Oo  =  a:  =  0;  Cauchy:  a<>  =  aj  =  2. 

2.7  Power  transformations  (also  called  Box-Cox  transformations)  are  defined  to  be 
following  functions  of  x  >  0: 


g{x\  A)  = 


xx  —  1 

“IT"’ 

logx, 


A  0 
A  =  0 
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A  power  transformation  g{x\  A)  is  increasing  in  x,  and  at  x  =  1  has  value  0  and  slope  1. 
Note  that  g'(x;  A)  =  xx~1 . 

Important  classes  of  distributions  are  the  power  uniform  and  the  power  exponential. 
A  random  variable  X  is:  (1)  power-uniform  if  for  some  constant  A 

X  =  g{U;X)  or  X=-g(U;X) 

where  U  is  uniform  [0,1];  (2)  power-exponential  if  for  some  constant  A 

X  =  g{W,X)  or  X=-g(W;X) 
where  W  is  Exponential  with  mean  1. 

The  three  types  of  extreme  value  distributions  are  actually  an  infinite  family  of  power- 
exponential  distribution.  The  case  of  A  =  0  is  of  great  theoretical  interest,  but  may  not 
be  useful  in  practical  model  fitting. 

It  should  be  noted  that  Tukey  A-family  of  distributions  is  defined  to  have  quantile 
functions 

g(u)  =y(u;A)-p(l-u;A) 

which  are  functions  of  u  on  0  <  u  <  1  which  are  symmetric  about  u  =  .5. 

Power  transformations  play  a  central  role  in  extreme  value  distribution  theory  be¬ 
cause  of  the  quantile  tail  properties  that  can  be  established  for  regularly  varying  quantile 
functions  with  tail  exponents  Oo  and  . 

2.8  Quantile  Tail  Value  Property.  Regularly  varying  quantile  functions  are  power- 
uniform  in  the  tail. 

For  any  sequences  tn  -*  0,  yn  -*  y  >  0,  as  n  tends  to  oo  : 

0o.„(y»)  -*■  -y(y;i-«,) 

defining  the  tail  quantile  functions 

g0,n(y)  =  hoQ(tn){Q(tny)  ~  Q{tn)) 

Si,n(y)  =  -  t»)(Q(i  -  t»y)  -  Q(i  - 1„)) 

One  can  show  that  g0i„  (u),  0  <  u  <  1,  is  the  quantile  function  of  the  random  variable 
censored  at  the  left  tail 


y  =  ih«(t.){x- <?(<„)},  a-  <  <?(*„) 


with  distribution  function,  on  -oo  <  x  <  0, 


Fr(x)  =  F(Q(tn))  + 


To  prove  the  quantile  tail  property  one  writes 

*.«(*.)  («(«.»)  -  <?(!„))  =  r  /'"{/<3(i.)//«W}^ 

•»  /t, 

=  r  im.)/fQ(zt.)}dz 

=  J  z—iz  +  o(t.) 

The  rate  of  convergence  of  L0(tn)/L0(ztn)  to  1  determines  the  rate  of  convergence  of  the 
quantile  tail  property  which  is  the  most  important  factor  affecting  the  rate  of  convergence 
of  extreme  values  to  their  limit  distributions. 

DeHaan  (1985)  has  shown  that  in  order  that  extreme  values  from  a  distribution  with 
quantile  function  Q(u)  have  limiting  distributions,  it  is  necessary  and  sufficient  that  Q(u) 
obey  the  above  conditions  (for  suitable  sequences  replacing  hoQ(tn)). 

DeHann  shows  (p.33)  that  a  distribution  function  F  satisfies,  for  some  choice  of  con¬ 
stants  a*  >  0  and  bn  (n  =  1, 2, . . .) 

(1)  F"  (a*  x  +  6* )  — »  G(x) ,  extreme  value  distribution, 
weakly  (as  n  tends  to  oo)  if  and  only  if  for  a  positive  function  a(t) 

<3(1  -  <v)  -  9(1  - 1)  ,  „ 

- 40 - «**) 

The  proof  consists  of  the  following  steps:  (1)  is  equivalent  to 

(2)  n{l-F(anx  +  &„)} -♦ -logG(x); 

—  log  Gr(x)  has  inverse  function  (of  y)  equal  to 

-Ag{y\  A  )  +  B 

for  some  constants  A  >  0,  B  and  A  real  since  if  G(x)  is  distribution  function  of  —g{W ;  A), 
W  exponential  with  mean  1, 

G(x)  =  PROB[W  >  0_1(-*;A)] 

=  exp(-0"l(-x;  A)), 

—  log  G(x)  =  y-l(— x;  A); 

the  inverse  of  y  =  n{l  -  F(o*x  -I-  &„)}  is 

n 


r*.»  » 


F  ■  r*,r*  l 


(2)  is  equivalent  to  convergence  of  the  inverse  functions: 

(3)  i{<J(l-£)-U-.-A»(v;A)  +  B; 

**» 

subtracting  (3)  for  y  =  1,  one  obtains 

i-W(l-*)-<}(l-i)>--A((y;A) 

On  n  n 

We  will  apply  the  quantile  tail  property  with  y„  =  (l/tn)Qu(k;n)  (u)  and  y  =  Qa  (u). 
We  use  the  following  general  property  of  convergence  in  distribution  (called  Rubin’s  theo¬ 
rem:  see  Fabian  and  Hannan  (1985),  p.  142). 

Theorem:  If  gn  (yn )  — ►  g(y)  for  every  sequence  y*  — ►  y,  and  if  Yn  -*  Y  in  distribution, 
then  y»(Tj,)  — »  g{Y)  in  distribution. 

2.9  A  powerful  property  of  quantile  functions  is  how  they  behave  for  monotone  func¬ 
tions  of  random  variables. 

Theorem:  Y  =  g(X)  has  Qy  (u)  =  g(Qx  («)),  assuming  y(z)  non-decreasing  and 
continuous  from  left  (which  is  true  for  a  quantile  function  Q(u)).  Note  that  when  g  is  non¬ 
increasing  and  continuous, 

Y  =  g(X)  has  Qr  (u)  =  g(Qx  (1  -  u)) 

2.10  The  quantile  representation  of  X  in  terms  of  U  is  another  powerful  property  of 
quantile  functions. 

Theorem:  Quantile  Representation.  Q(U)  =  X  where  =  denotes  identical  distribution 
(equality  in  distribution),  and  U  is  uniform  [0,  lj. 

Proof.  We  outline  two  proofs  of  the  quantile  representation.  From  the  formula  for  the 
quantile  function  of  a  function  of  a  random  variable 

QomM  =  Wu(tO)  =  Q(u)  =  Qx  (u). 

Alternatively,  Q(U)  <  x  if  and  only  if  U  <  F(z).  Consequently, 

Pr[Q(tf)  <  x]  =  Pr[tf  <  F(x)]  =  F(x) 

2.11  The  foregoing  results  yield  an  important  theorem  relating  the  distribution  of 
general  order  statistics  to  uniform  order  statistics. 

Theorem:  X(k’,n)  =  Q{U(k]n)), 


Qx(fc;n)(t*)  -  Qo(U(fc; n))(u) 


2.12  Identification  Quantile  function  QI(u),0  <  u  <  1,  is  defined  by 

QJ(u)  =  (Q(u)  -  MQ)/DQ 

where  MQ  =  Q(.5)  is  the  median,  and  DQ  =  2(Q(.75)  —  Q(.25))  is  the  quartile  deviation. 

The  values  of  QI(u)  for  u  near  0  and  1  can  be  shown  to  provide  quick  diagnostics  of 
the  tail  classification  of  the  distribution. 

2.13  Location  -  Scale  parameter  models  for  probability  distribution  are  expressed  in 
terms  of  quantile  functions  by 

Q(u)  =  p  +  oQ0(u) 

where  Q0  (u)  is  a  specified  Standard  quantile  function.*  We  propose  that  if  one  always 
chooses  Q0  (u)  =  QI( u),  then  one  can  more  easily  compare  p  and  o  corresponding  to 
different  models. 

An  alternative  normalization  is  the  unitized  quantile  function 

Ql(u)  =  (Q(u)  -  MQ)/q{ 0.5) 

The  corresponding  probability  density  function  fx  (x)  has  the  property 

fi  (median)  =  1. 

For  a  normal  distribution  Ql(ti)  has  probability  density  A(x)  =  e-*’*. 

3.  Beta,  Gamma,  Power  Distributions. 


3.2  Gamma  distribution.  Gamma(m)  where  m  >  0,  has  probability  density 


Note  that  Ganuna(l)  is  the  exponential  distribution.  We  define 


GammaA(m)  =  — Gamma(m) 
m 

3.3  PowerG(A,m)  distribution,  where  m  >  0  and  — oo  <  A  <  oo,  is  defined  by  a 
random  variable  representation: 

PowerG(A,  m)  =  g(GammaA(m);  A) 

For  m  =  1,  this  is  the  power-exponential  distribution. 

The  quantile  function  of  the  distribution  is  given  by 


(1) 


QvowttO  ( A,m  )  (^)  9{.  Qo»mm»(m)  (**)>  A) 


m 


Note  that  Qa,Bm»(m)H  does  not  have  an  analytic  formula  (for  m  not  equal  to  1)  but  is 
computed  numerically. 

3.4  NegPowerG(A,m)  distribution  is  defined  by 

NegPowerG(A,m)  =  —  ff(GammaA(m);  A) 

Qv  efPow*rO(i,m)  (®)  =  j((l/*^)Oo«niB»|n)  (l  ^)i  A) 


3.5  Wilson  Hilferty  transformation  of  Gamma(m)  to  normal  is  an  approximation  for¬ 
mula  for  quantiles 


Qo>nn>A(ffl)(^j  Qoima>(n)M 


Graphs  of  exact  and  approximate  quantile  functions  indicate  how  well  the  approxi¬ 
mation  works,  even  for  m=l;  see  Figure  1. 

3.6  Quantile  function  of  PowerG  is  approximated  by  substituting  (2)  into  (1). 

4.  Exact  and  Asymptotic  Distributions  of  Extreme  Order  Statistics. 

4.1  The  exact  distribution  of  X(k',n)  is  expressed  in  terms  of  quantile  function  Q(u) 
and  the  quantile  function  of  U{k\n): 

Qx(fcin)(«)  =  Q(Qu(fc;r,)(u)) 


The  quantile  function  of  U(k]n)  is  obtained  numerically  from  the  distribution  function  of 
the  Beta  probability  density. 


4.2  Theorem:  U(k;n)  is  Beta  (k,n  +  1  —  k ): 


fu(k;n)(*)  = 


B(k,  n  +  1  -  k) 


z*-1(l  -z) 


n  —  k 


0  <  x  <  1. 


We  write  this  fact  in  terms  of  quantile  functions: 


Qu(k;n)  (tt)  —  Qb  eta(k,n+l-k)  (u) 

Proof:  To  compute  the  probability  that  U(k;n)  is  in  (z,z+dz),  there  are  n  ways  to  choose 
a  variable  in  (z,  x+dx)  and  ("2  J)  to  choose  k  —  1  variables  in  (0,  z).  For  specified  variables 
lying  in  these  regions,  probability  of  event  is  z*-1(l  —  z)n_fe.  The  product  of  n,  (*!})> 
and  z*“ 1  (1  —  z)n~*  is  the  value  of  the  probability  density. 

Graphs  of  Beta  probability  densities  should  be  examined  for:  n  =  20,  k  =  1,2,...,  20; 
n  =  100,  k  =  1, ...,10,20, 30, 40,  50,60,70,  80,  90,  81,... ,100. 

4.3  The  mean  of  U(k;n)  is  the  expected  value  of  the  ktk  order  statistic  from  a  uniform 
[0, 1]  and  is  given  a  special  notation: 

E[U{k;n)}  =  =  «fc;„ 

The  covariances  also  play  a  central  role;  one  can  show  that,  for  j  <  k, 

Cov[f/(j';n),tf(*;n)]  =  — J— uy.„(l  -  ufc!B) 

n  ■+■ 1 

We  review  the  proof  of  the  formula  for  mean: 

E\U(k‘,n)}  =  J)  jf  zzfc_x(l  -z)n"kdz 

4.4  The  asymptotic  distributions  of  U(k;n ),  U(n  +  1  -  k;  n),  expressed  in  terms  of 
quantile  functions,  are 


i 
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QnU{k;n)(u)  Q  Gam  m  aUlWiQffUin)/.,;,  (*)  -  iQ  Gam  m  •(*)(“)» 

))(U)  Qa  am  m  a  {*)(«)• 

The  form  in  which  we  apply  these  results  in  Theorem  4.5  is 

^(n  +  -{{i/k)Gamma{k)  -  1} 

l-uB  +  1_fcin 


a 


A 


n 


These  important  relations  follow  (by  general  theorems)  from  the  fact  that  the  probability 
density  functions  obey  similar  limit  theorems  or  from  representations  of  uniform  order 
statistics  in  terms  of  consecutive  sums  of  exponential  random  variables.  Graphical  repre¬ 
sentations  of  this  approximation  are  given  in  Figures  2A-2B. 

The  proof  in  terms  of  density  functions: 


fnU(k;n){y )  —  ^ftj(k  ■,n){y/n) 


— ►  - li*  *• 

(k  —  l)i 

fn(l-U(n+l-k-,n))(y)  =  -/tf (»+l-kjn)(l  ~  ~) 

n  n 


(*  -  1)! 


y*  1  e  9 


4.5  The  exact  distribution  of  is  expressed  in  terms  of  quantile  function®. 

Qx(k;»)(u)  =  Q<J(0(k;»))(«)  =  Q{Q  B  et«(k,n+  i  -  k)  («)) 

The  asymptotic  distribution  of  X(k;n)  is  obtained  by  defining  uk;n  =  k/(n  +  1),  and 
writing 

*(*;»)  -  <2(«kin)  =  9(tf(*;»))  -  QK-. .). 

The  Quantile  Tail  Property  enables  us  to  show  that 

— ►  g(GammaA(k)-,  1  -  ccq) 

using  Rubin’s  theorem.  One  can  conclude  the  following  basic  theorem  on  the  distributions 
of  extreme  values  under  suitable  conditions  of  regular  variation  of  the  density  quantile 
function. 

4.5  THEOREM:  ASYMPTOTIC  DISTRIBUTION  OF  EXTREME  VALUES 
ho Q(uk;„)(X(fc;n)  -  Q(ufci„))  g(GammaA(k);  1  -  Oo) 

=  PoiverG(l  —  Oq  ,k)random  variable, 
hiQ{  1  -  «*;»)(*(«  +  1  -  k]n)  -  Q(1  -  u*.n)) 

- g (Gamma A(k)\  1  -  ar) 

=  NegPoxverG{  1  —  ax  ,k)random  variable 


This  basic  theorem  is  proved  in  “one-line”  from  the  quantile  tail  property,  Rubin’s 
theorem,  and  the  asymptotic  distributions  of  extreme  values  of  uniform  distribution.  It  is 
more  precisely  stated  in  terms  of  quantile  functions,  and  illustrated  by  graphs  comparing 
exact  and  asymptotic  quantile  functions  of  extreme  and  central  order  statistics  (to  be  given 
in  the  sequel). 

4.6  The  limit  theorems  that  we  have  given  for  extreme  values  are  stated  in  a  way  that 
will  enable  us  to  maximize  the  analogies  between  the  asymptotic  distribution  of  extreme 
and  central  order  statistics.  Our  goal  is  to  obtain  unified  formulas  so  that  we  will  not  have 
to  decide  in  practice  what  is  the  dividing  line  between  extreme  and  central  order  statistics. 
Order  statistics  X(k;n)  are  called:  extreme  if  k  is  fixed;  end  if  k/n  — ►  0  or  1  (as  n  — ►  oo); 
and  central  if  k/n  — ►  p,  0  <  p  <  1. 

5.  Asymptotic  Normality  of  Central  Order  Statistics. 


5.1  To  study  the  distribution  of  central  order  statistics  X(k;n),  we  introduce  sample 
quantile  function 

<3"(tt),0  <  u  <  1 

defined  by:  for  k  =  1,  ...,n 


<?-(«)  =  X(fc;n), 


k  —  1  k 

- <  u  <  - 

n  n 


Equivalently 


g»  =  F’-1(u) 

where  F~(x),  —  oo  <  z  <  oo  is  sample  distribution  function; 


F~{x)  =  —  for  X(k]n)  <  x  <  Xlk  +  l;n) 
n 


5.2  When  F  is  continuous,  a  basic  role  is  played  by  Bahadur  representation:  For 
0  <  u  <  1 

<?-(«)  -  <3(u)  =  -(F-«(u))  -  «)//<?(«)  +  R. 
n  5  Rn  4  0  as  n  -*  oo 

5.3  The  asymptotic  distribution  of  central  order  statistics  X(k\n)  is  obtained  from 
the  asymptotic  distribution  of  Q’[u)  since  Q'(u)  =  X(k\n)  for  ( k  —  1  )/n  <  u  <  k/n.  A 
value  of  u  in  this  interval  is  denoted  by  uk.n .  Typical  formulas  for  ut;n  are 

«fc:»=k/(n  +  l)  or  uk;n  =  (k  -  0.5)/n 

In  this  paper  we  use  the  first  definition. 

The  asymptotic  distribution  of  the  sample  distribution  function  can  be  written:  for 

0  <  p  <  1 

(n  +  ir{r(Q(p)-p))^(0,p(l-p)) 
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From  Bahadur’s  representation  one  obtains  the  asymptotic  distribution  of  the  sample 
quantile  function  which  can  be  written,  for  0  <  p  <  1, 

(»  + 1)°  ‘/<5(p){<3'(p)  -  9(p)>  =  N(0,p(i  -  p)) 

5.4  The  asymptotic  distribution  of  Q(p)  can  be  expressed  in  terms  of  hazard  quantile 
functions. 

(n  +  1)°  ‘/>o<30>){<?*(p)  -  9(p)}  =  N( 0,  (1  -  p)/p),0  <  p  <  0.5; 

(»  +  1)°  ‘/.,Q(p){l}-(p)  -  «(p)}  =  N(0,p/(1  - p)),0.5  <  p  <  1 

To  unify  the  theory  of  central  and  order  statistics  we  propose  an  alternate  approximation 
which  is  equivalent  in  the  limit  to  the  above. 

6.  Unified  formula  for  asymptotic  distribution  of  extreme  or  central  order 
statistics  X(fc;n).  . 

Define  ufc;n  =  k/(n  +  1), 

mo  =  (n  +  l)u*;n/(l  -  ttfcin),  mt  =  (n  +  l)(l  -  ufc;n)/ufc;n 
For  0  <  «*.„  <  0.5, 

Ao <?(«*;,. KQ'O**,,.)  ~  =  g (Gamma Afrrio);  1  -  oto) 

For  0.5  <  <  1 

fciQKjnJiQ'K;,)  -  <?(«*•„)}  =  -giGammaA^),!  -  a t) 

This  formula  is  stated  in  terms  of  random  variables.  For  numerical  calculations  it  is 
easiest  to  write  it  in  terms  of  quantile  functions:  For  0  <  ufc.„  =  k/(n  +  1)  <  0.5 

Qx(fc;«)(«)  =  +  Qo(tt), 

Qo(u)  Q(uk  )  at)  (U) 

For  0.5  <  uh.n  =  k/(n  +  1)  <  1 

Qx(fc;n)(«)  =  +  Ql  («) 

Ql(u)  =  {/»iQ(ufc.,,)}  1{  Qf(OtmmiA(mi);l-Oi)(l  ~  u)} 

When  applied  to  extreme  order  statistics  (say  X(k;n)  for  k  fixed)  the  unified  formula  uses 
as  the  Gamma  degrees  of  freedom 

=  ~ 

n  +  i 

rather  than  k.  When  applied  to  central  order  statistics  the  unified  formula  uses  a  Pow- 
erGamma  distribution  rather  than  the  normal  distribution. 

To  empirically  demonstrate  the  practical  use  of  the  unified  formula,  one  compares  the 
quantile  function  of  the  exact  distribution  of  X(k;  n), 

Qx(*;n)(u)  =  Q(Beta(k,  n  +  1  -  k)) 

with  the  approximate  distribution  given  by  the  unified  formula.  The  examples  we  have 
considered  are  n  =  20, k  =  1,...,  10  and  n  =  100,  k  —  1, ...,  10,20,30,40, 50  for 


Table  S:  Distributions  Used  to  Compare  Exact  and  Approximate  Density  Quantile 
Functions  of  Order  Statistics 


Uniform, ato  =  0, 

Q(u)  =  u 

hoQ(a)  =  u_1 

Weibull,  Oo  =  1-0 

<3(u)  =  {Iog(l -«)•*}' 

0  =  1/6, 2/6, 3/6, 4/6, 5/6 

hoQ(u)  =  J*"1^  -  tt){log(l  -  u)-*}1-' 

Cauchy  Oo  =  2 

Q{u)  =  tanx(u  -  .5) 

=  —  cos  xu  fain  xu 

AoQ(u)  =  {sin(7ru)}2/xu 

1)  Cauchy  distribution  Q(u)  =  tanx(u  -  0.5),  see  Figures  4A-4D; 

2)  exponential-type  Q(u)  =  logu, 

3) <?(u)  =  (logtt)06 


7.  Approximate  density  quantile  function  of  order  statistics. 

The  density  quantile  function  fQx  (*;»)(u)  of  the  probability  distribution  of  X(k ;n) 
can  be  obtained  from  the  quantile  function  Qx(t;fl)(u)  by  the  formulas 


9x(h;n)(tt)  -  <&(fc;1*)(u),  fQx(k-,n){v)  =  fx(fc(»)(U) 


One  obtains  in  this  way  the  following  approximate  formulas.  Let  uk.n  =  Jb/(n  -f  l),mo  = 
(n  +  l)ufciB/(l  (n  +  1)(1  -  u*:*)/ufc:„.  For  0  <  <  0.5, 


fQx  (*<t)(u)  =  y/rn^ho  Q(tt*;i,)^*~ 1  (u) 

(1  + 


For  0.5  <uk.n  <  1 


fQx(k;n)[v)  = 


(1  + 


1 


*-‘(1  -«)-—)» 


•»  -2 


We  graph  the  unitised  density  quantile  functions  / Q(u) /  fQ(.S),  and  record  the  value  of 
/Q(-5),  for  the  cases  listed  in  Table  3.  The  accuracy  of  the  approximation  can  be  judged 
by  comparing  (see  Figures  3A-3B)  the  unitised  approximate  density  fQxn  «)(**)  of  the 
sample  minimum  with  the  unitised  density  quantile  of  the  Weibull  distribution  (which  is 
the  usual  formula  for  the  asymptotic  distribution): 


fQ{ «) 


(l-«) 


{log(l  -u)  ,}‘ 


- » 


a 


8.  Stationary  Time  Series. 


When  the  sample  X(l), . . . ,  X(n)  is  from  a  stationary  time  series  we  can  represent 

tm  1 

defining 

c(x;p)  =  l-p  if  x  <  Q(p) 

=  -p  if  x  >  Q(p) 

Its  asymptotic  distribution  can  be  written 

(n  + 1  )°‘(f(<)(p))  -  P)  =  AT(o,p(i  -  p)A(-;p)) 

n 

defining  fa(u;p)  to  be  the  spectral  density  function  of  the  two  valued  time  series  c(JT(t);p). 
The  time  series  concepts  are  illustrated  in  Figures  5A-5C.  The  spectral  density  /(w),  0  < 
w  <  1,  of  a  stationary  time  series  Y(t)  with  correlation  function  p(v)  =  Corr[y(i),F(t+t;)] 
is  defined  by 

oo 

/(«*>)  =  51  exP{~2*'VUJ)p{v). 

•  ■  -  OO 

The  formulas  for  asymptotic  distribution  of  order  statistics  continue  to  hold  if  we 
define 

"*0  =  {/«(£;  tt**)}-1  (»  +  l)ttfc;f,/(l  —  «*;„), 

m»  (n+l)(l-ufcin)/u*.n 

We  conjecture  that  these  formulas  continue  to  hold  also  for  extreme  value  statistics. 
If  the  conjecture  is  true,  extreme  value  statistics  of  stationary  time  series  have  unimodal 
distributions.  We  need  to  investigate  how  to  compare  this  result  with  the  asymptotic 
distribution  of  extreme  values  (in  terms  of  waiting  times  of  compound  Poisson  processes) 
found  by  Leadbetter  and  Hsing. 
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FIGURE  CAPTIONS 


Figure  1.  Exact  (solid  curve),  and  Wilaon-Hilferty  Approximations  (dotted  curve)  to, 
quantile  functions  of  Gamma  distributions  with  1,  2,  and  3  degrees  of  freedom. 

Figures  2A,  2B.  Exact  beta  and  approximating  gamma  distributions  of  X(l;20)  and 
X(2;20)  from  uniform  distribution.  Graphs  of  density  functions,  identification  quantile 
functions,  and  unitized  density  quantile  function. 

Figures  3A,  3B.  Exact  and  approximating  unitized  density  quantile  functions  of  X(l;  100), 
X{2\  100),  X(3;  100)  from  Weibull  g(tt)  =  {-  log(l  -  u)}' ,  0  =  .333  and  /?  =  .167. 


Figures  4A,  4B,  4C,  4D.  Exact  and  approximating  quantile  functions  and  unitized  density 
quantile  functions  for  Jjf(l;100),  X(2;100),  Jf(3;100),  *(4;  100),  X(20;100),  AT(30;100), 
X(40;  100),  X(50;  100)  from  Cauchy  distribution. 


Figures  5A,  5B,  5C.  Time  series  analysis  of  samples  of  length  n  =  100  from  AR(1)  model 
Y(t)  -  .95  Y(t  -  1)  =  e(t)  clipped  at  sample  Q(u),  u  =  .5  and  u  =  .975. 
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